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Abstract. By disintegration of transport plans it is introduced the notion of trans- 
port class. This allows to consider the Monge problem as a particular case of the 
Kantorovich transport problem, once a transport class is fixed. The transport prob- 
lem constrained to a fixed transport class is equivalent to an abstract Monge problem 
over a Wasserstein space of probability measures. Concerning solvability of this kind 
of constrained problems, it turns out that in some sense the Monge problem corre- 
sponds to a lucky case. 
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Introduction 

Optimal transport problems, also known as Monge-Kantorovich problems, have been 
very intensively studied in the last years giving rise to numerous and important ap- 
plications to PDE, Shape Optimization and Calculus of Variations, so we witnessed 
a spectacular development of the field. The interested reader may look at the mono- 
graphs and lecture notes [3j El [TTJ [HI [191 12Q1 (2TJ [22] where the subject is fully devel- 
oped. 
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Let us briefly recall the formulations of the Monge-Kantorovich problems. 

Let X, Y be two compact metric spaces and let c : X x Y — > M + be a Borel cost 
function. The Monge problem is formulated as follows: given two probability measures 
fi G V(X), v G V(Y) find a measurable map t : X — > Y such that t#fi — v (# denotes 
the push-forward of measures) and such that t minimizes the total cost, i.e. 



t-.X^Y 

It may happens that the set of admissible maps is empty (e.g. \x = 5 X and v = 
\{8y + S z )). Then the problem could be reformulated in its Kantorovich's relaxation: 
find 7 G V(X x Y) such that 7^7 = /i and 71^7 = v (ir 1 and n 2 are the projections on 
the factors of X x Y) and such that 7 minimizes the total cost, i.e. 



mm 



c(x, y) ^7(2;, y) | tt^7 = fi, ^7 = v \ . (0.2) 

The admissible measures 7 for the Kantorovich problem are called transport plans. We 
denote by Il(/j, z/) the set of transport plans with marginals /i and v. If t is admissible 
for the Monge problem then the measure associated in the usual way to the graph 
of t, i.e. 7 = (Ix x £)#/U, is admissible for the Kantorovich problem. However the 
class of admissible measures for the Kantorovich problem is never empty as it contains 
/x g) v. Moreover, the Kantorovich problem is a linear one. Existence of minimizers for 
the Monge problem is difficult and may fails, while for the Kantorovich problem the 
semicontinuity of c is enough to ensure existence of minimizers. 
If X = Y, and c = d is the distance function, for p > 1 the cost 

W p (fi, v) = ^min f^J d p (x, y) d-y(x, y) : 7GlI(/i,z/)j^ 

defines a distance on V(M) called p-Wasserstein distance. 

Let us recall that by Kantorovich duality (see [3j [HJ (2TJ [22]) the 1-Wasserstein 
distance between [i and z/, which we will simply denote by W, can be expressed as 
follows 



W(n, v) = sup jy tfd(^-u) I ^GLipipTU, (0.3) 

where Lip x (X) denotes the set of Lipschitz function having Lipschitz constant not 
greater than one. 

Description of the results. A relevant tool in mass transportation theory is con- 
stituted by the Disintegration Theorem (Theorem II. ip of measures which states that 
every transport plan 7 G V(X xY) can be written as 7 = f(x)<^fx where f(x) G V(Y). 
We shall call disintegration map every / : X — > (V(Y), W) such that 



TRANSPORT AND DISINTEGRATIONS 



3 



In this paper we relate the structure of the set of transport plans v) with 
the push-forward of disintegration maps. Indeed, given the measure 7 = f(x) ® fi, 
obviously fi is the first marginal of 7, while the second marginal depends on the 
disintegration map /. An interesting feature of transport plans appears by looking to 
the measure Precisely, if r\ = g(x) ® [i is another transport plan, it results (see 

Lemma 11.51) : 

f#(JL = g#H => ?rj7 = ttJtj. 

Therefore, the second marginals can be fixed by looking to the push-forward of dis- 
integration maps. In this way the set of transport plans II(/i, v) can be structured 
in transport classes (see Definition 11.71) by setting r\ G [7] /#// = g#^. Roughly 
speaking (see Example II. 8p . fixing a transport class leads to consider a constrained 
transport problem with respect to splitting masses or traveling ones. Lemma [T73l shows 
that all transport plans induced by transport maps belong to the same transport class. 
Moreover, by density of transport maps in II(/i, v), it follows that (see Proposition [TJH]) 
such transport class characterizes the transport plans induced by transport maps. 
Therefore, in this perspective the Monge problem can be seen as a constrained Kan- 
torovich problem, namely 

min I J c(x,t(x)) dfM : t#fi — v j — min l^J c(x,y) dj : 7G [(J 8 ®/i]|, 

for a given transport map s. By density of transport maps, the Kantorovich transport 
problem also corresponds to 

min < / c(x,y) d'y : 7 G U(fJ>, v) \ = min < / c(x,y) dj : 7 G [5 S g) fi] W > , 

{JxxY J \.J XxY J 

for a given transport map s. 

Hence the Monge problem represents a particular case of a more rich structure of 
problems, obtained by fixing a transport class in the Kantorovich formulation of trans- 
port problems. In this context, considering transport problems in a fixed transport 
class is quite natural. Since we are looking to the push-forward trough disintegration 
maps, fixing a transport class results equivalent (see Section 2) to consider measures 
A G V(V(Y)) satisfying the barycenter constraint 

A eJA(A) = v. 

V(Y) 

The corresponding transport class is given by the transport plans f(x) £g> fi such that 
f#fj, — A. In this formulation we see that fixing a transport class is equivalent to 
consider transport maps / sending \i into A. So it is natural to consider the following 
Monge-Kantorovich problem in the class A: 



MK A (c, fi, v) := inf <^ / c(x, y) d 1 \ 7 = / ® //, = A \ . (0.4) 

T [JXxY 
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The above transport problem leads to consider an abstract Monge problem between 
the space X and V(Y). Let us consider the following transport cost 

V(x,A) G X x V(Y) : c(x,X) = J c(x,y) dX. 

We set 

M(c, fi, A) := inf | jT c(x, f{x)) d/i \ f # /i = A J . (0.5) 

For every transport class A (see Proposition 12 .2 j) we have 

M(c,/i,A) = MK A (c,n,u). 

Therefore, every existence result for the Monge problem M(c, /i, A) in the abstract 
setting corresponds to an existence result for the Monge-Kantorovich problem in the 
transport class A. Of course, minimizing in a transport class could be as difficult as 
for the Monge problem since, of course, the transport classes are not in general closed. 
However, by this reformulation it comes out that the Monge case is peculiar. More 
precisely, since the Monge problem is a particular case of transportation in a transport 
class, one may asks what happens for others transport classes. In other words, the 
matter consists in establishing if the abstract Monge problem admits solutions. The 
existence results for the Monge problem are usually stated in the following form: under 
some assumption on the spaces, on the first marginal fi and on the cost c(x,y), for 
every second marginal v the Monge problem admits solutions. For the abstract Monge 
problem M(c, fi, A) this is not the case. For discrete measures A, see Section 4, for 
the quadratic cost it results that M(c, fx, A) may not admit solutions. Therefore, in 
the abstract setting, it could be also interesting to consider, under some assumption 
on the spaces, on the first marginal ii and on the cost c(x,y), the question for what 
kind of second marginals the corresponding Monge problem admits solutions. From 
this point of view, in some sense the Monge problem is a lucky case. 

1. Disintegration maps and transport classes 

Let X,Y C WL N be two compact sets and let M. (Y) be the space of Radon measures 
on Y. A map A : X — > Ad(Y) is said to be Borel, or equivalently weakly*-measurable, 
if for any open set B C Y the function x G X h-> X x (B) is a real valued Borel map. 
Equivalently, x H- X x is a Borel map if, for any Borel and bounded map ^:XxF->R, 
it results that the map 

x G X i — y J Lf(x,y)dX x 

is Borel. 

Theorem 1.1 (Disintegration theorem). Let 7 G ViX x Y) be given and let n 1 : 
X x Y — )■ X be the first projection map of X x Y , we set fx = (tc 1 )^. Then for 
fi — a.e. x G X there exists v x G V(Y) such that 
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(i) the map x i->- u x is Borel, 

(ii) Vv? G C h {X x Y) : J XxY ^{x,y)d^ = J x (J Y <p(x,y)du x (y)) dfi(x). 
Moreover the measures v x are uniquely determined up to a negligible set with respect 
to fl. 

Let 7 G i/), as usual we will write 7 = <E> //, assuming that z^. satisfy the 
condition (i) and (ii) of Theorem 11.11 Obviously, the transport plan [i ® v corresponds 
to the constant map x t— > v x = v. Let t : X — > Y, be a transport map, observe that for 
the transport plan j t := (I x £)#//, the Disintegration Theorem yields 7 4 = £g> fj,. 
Therefore, the disintegration procedure for a transport plan 7 = f(x) <S> produces a 
map 

/ : X ->■ (P(F), W), s.t. x h-> /(x) is Borel. (1.1) 
We shall refer to such maps / as disintegration maps. For a transport map t the 
corresponding disintegration map is given by x \-> S t ( x )- Of course, it is possible to 
look at a disintegration map as a measurable map between X and (V(Y), W). Indeed, 
we have the following result. 

Lemma 1.2. A map f : X — > (V(Y), W) is a disintegration map if and only if f is 
measurable. 

Proof. Let / : X — > (V(Y), W) be measurable and let A C Y be an open set. Observe 
that f(x)(A) = j Y XA(y)df(x). For a l.s.c. function <p over Y, define 

I v : (V(Y),W) -+ R, I v (\) := [ <p(y)dX. (1.2) 

JV(Y) 

Since W metrizes the weak* topology of measures, we have that I v is a l.s.c. map. 
For every x G X it results 

J v(y)df(x) = I v {f{x)). 

If / is a measurable map, it follows that the map f(-)(A) : X — > R is Borel as 
composition of a l.s.c map and a measurable one. Hence / is a disintegration map. 
Vice versa, observe that by the Ascoli-Arzela Theorem the space Lip 1 (F) is compact 
with respect to the uniform convergence. Fixed a countable dense subset D C Lip 1 (F), 
by Kantorovich duality we have 

W(vx, vq) = sup / u d{v\ — v 2 ) = sup / u d(v\ — z/ 2 ). 

ueLipj (Y)Jy ueDJY 

Since x 1— > f{x) is Borel, we have that, for every u G Lip 1 (y), g : X — > R defined by 
g u (x) := J y m div — f(x)) is a Borel map . To check that / is a measurable map, it is 
sufficient to observe that 

r\B(u,r))= pl^Q-M) :=A 
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Indeed, if x G A we get |<7«(x)| < r, for every u G D. Hence, by definition of g u , it 
follows that W(u, f(x)) < r and then f(x) G B(u,r). On the other hand, if f(x) G 
B(u, r), i.e. W(u, f(x)) < r, by Kantorovich duality |<7 u (x)| := \ f Y u d[y — f(x))\ < r 
for every u G D. This implies x G A. □ 

Let X C M. N , we recall that the barycenter of a measure fi G V(X) is given by 

/3(fi) = x dfi. 
Jx 

Disintegration maps naturally produce measures of the form f#fi on the space 
(V(Y),W). By the following lemma, we see that this point of view is equivalent 
to fix the second marginal of transport plans induced by transport maps. 

Lemma 1.3. Let t, s : X — >■ Y be two given Borel maps, fi G ViX) and let j ',g : X — > 
V{Y) defined by f(x) = 5 t{x) , g(x) = 5 s(x) . Then 

t # fi = s# n <£> f#n = g#fi. (1.3) 

Proof. Assume /#// = g#fi and for any (p G C(Y) let us consider the function 1^ 
defined in (Q]l . Observe that I v G C((V(Y), W)). Hence we have 



I V (X) = / / V (A) <^> / I<p(f(x)) dfi= I v (g(x)) d\i 

V(Y) JV(Y) JX JX 



l x (J y f(y) df(x)j d» = J x ( I <p(y) dg(x) ) dfi. ( I ^ I) 



tp(y) df(x) = (p(t(x)), / tp(y) dg(x) = <p(s(x)), 

Y Jy 



Since 



bi (TO]) we get 

/ (p(t(x)) dfi — ip(s(x)) dfi. 
Jx Jx 
By the arbitrariness of <p we infer t#fi = s#fi. 

Vice-versa, for every ip G C((V(Y), W)) let us consider the function ip(y) = ip(S y ). 
Observe that ip G C{Y). If t#fi = s#fi we compute 

ip d(f#fi) = / ip{f(x)) dfi= ip(5 t {x)) dfi = 

Y Jx Jx 



cp(t(x))dfi= / tp(s(x))dfi= / ip d(g#n). 
'x Jx Jy 

By the arbitrariness of ip we obtain f#[i — g#fi. □ 

Corollary 1.4. Let t,s : X — > Y be two given Borel maps, fi G V(X), let f,g : X — >• 
V(Y) defined by f(x) = S t M, g(x) = 6 s r x ) and let 7 = f(x) <S> fi, rf = g(x) ® fi. Then 



7T. 



#7 = n # rf <=> f # fi = g # fx. (1.5) 



TRANSPORT AND DISINTEGRATIONS 7 

Observe that the first part of the proof of the above Lemma works for general 
transport plans 7 = f(x) <g) fx, r\ — g{x) ® fi. Actually, by (jl.4j) we get the following 

Lemma 1.5. Let /i G V(X), f,g : X — > V{Y), 7 = /(x) <g> /i ; = g(x) ® fi be given. 
Then the following implication holds true 

/#// = g#fi 71-J7 = ( L6 ) 

Therefore, also for transport plans, the second marginal can be fixed by fixing the 
push-forward of disintegration maps. 

Notice that in general the converse of (11.61) is not true as we show in the next 
example. 

Example 1.6. Let / : X — > V(Y) defined by f(x) = v and let 7 = f(x) eg) /i. Let 
rj = g(x) (g) fj, where g(x) = 5t( x ) for a given transport map t : X — > Y with t#f/, — v. 
For every if} G C((V(Y), W)) we have 



ip d(g#n) = / ^(fy*)) dfi, 

V{Y) JX 

while 

/ ip d(f#n) = il){v). 

JV(Y) 

For any ip G C(Y) let us consider "0(A) = \J Y (p(y)d\\ = |/,p(A)|. We compute 



ip(5 t{x )) dfi = 
x Jx 



<f(y) dS t{x) 



d(j,= \(p(t(x))\ dfi = / \<p(y)\du. 



x 



However, on the other hand ^{v) = \j Y <p(y) du\. 

The above arguments allow to characterize transport plans through the push forward 
of disintegration maps. We introduce the following notion of transport class. 

Definition 1.7 (Transport classes). Let 7, 77 G n(/i, v) with'-/ = f(x)®/i, n = g(x)®[i 
be given. We shall say that 7 and r\ are equivalent (by disintegration) , in symbols 7 ~ rj, 
if f#fi = g#fi. 

For any given rj G n(/i, v) with rj = g(x) ® fi, we shall call transport class of r\ the 
equivalence class of transport plans given by 

[v] = {7 = f(x) ®n\f#n = g#fj}. (1.7) 

Notice that in the case of discrete first marginal \i = Y2i a i5 Xi , for any disintegration 
map it is easily seen that 

i 

Therefore, transport classes are fixed by the range of /. 
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Example 1.8. Let 



l r 1, 1, 1, 5 r 



f(x 1 ) = 3(a8 yi +b5 V2 ), f{x 2 ) = 6 y2 , f(x 3 ) = 5 y2 , a = b=-. (11 



X\ •,. X\ •. 

■'■> • . .. '^Vl X 2 » .'/I 

• //"J •'•.( • ' 2/2 

Figure 1.1. Transport plans in the same class. 

Consider the transport plan which uniquely splits the mass at X\. This transport 
plan corresponds to the disintegration map 

1 

6' 

By changing the point at which the mass is splitted, the range of the corresponding 
disintegration map does not change. For instance, for the second transport plan in 
Figure 11.11 we get the following disintegration map 

g{xi) = 5 y2 , g{x 2 ) = 3{a5 yi + b5 y2 ), g{x 3 ) = 5 y2 , a =b=-. 

o 

It follows = <?#/•*• Analogously, all the transport plans with only one splitted mass 
belong to the same transport class. 

On the other hand, by changing the number of splitted masses the corresponding 
disintegration range is changing. 

X 2 ' :: » 

• ■•'•''•'*» 2/2 

Figure 1.2. Two splitting masses. 



Indeed, by looking at Figure ll.2[ we may consider the disintegration map 

h(xi) = 3(a'5 yi + b'5 y2 ), h(x 2 ) = 3(c'5 yi + d'8 y2 ), h(x 3 ) = 5 y2 , 
,3^7,2 



30' b 30' ° 30' d 30' 



In such a case we have that /#// 7^ h#ix. On the other hand, by keeping fixed the 
number of splitted masses, the transport class may be changed by modifying the 
amount of traveling masses. Consider for instance the disintegration 

k(xi) = 3(a"5 yi + b"5 y2 ), k(x 2 ) = 3(c"5 yi + d"5 y2 ), k(x 3 ) = 5 V2 , 
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a" = — , b" = — , c" = — , d" = — . 
30' 30' 30' 30 

We get h#n ^ k#fi. 

Therefore, to fix a transport class leads to consider a constrained transport problem, 
with respect to splitting masses or traveling ones. 

By Lemma Tl .31 it follows that all transport plans induced by transport maps belong 
to the same transport class. Since the transport maps are dense in fl(/i, v) we can 
prove the following result. 

Proposition 1.9. Let s : X — > Y, be a transport map, i.e. such that s#[x = v, with \i 
non-atomic, and let 77 = (I x = 8 s m <g> fi. //7G [77] then there exists a transport 
map t : X — >■ Y such that 7 = 5 t ( x ) <8> /j, i.e. the transport plan 7 is induced by a 
transport map t. In particular, if 7 = f(x) <g) fi, it results t(x) = (3(f(x)) fx-a.e.. 

Proof. By applying [3J Theorem 9.3], see also [IT] , and the same argument employed 
in the proof of ([U Theorem 2.1]), we find a sequence of Borel maps t n : X — > Y such 
that 

7 = lim 5 tn M <g) /x, (t n )#/J = f VnGN 

n— >+oo 

and therefore ^t n ( x ) 8/i 6 [77], Vn G N. Consider </j(t/) = |t/| 2 . By the push-forward 
constraint we get 

2 J.. _ / ,.u j.. _ / „/„.\ j / 1. .12 



dfi= tp(t n (x)) dfi = / (f(y) dp = I \y\ dv < +00. 

Let us consider ^ G C((7 : '(y), W)) defined by ip(5 y ) = \y\ 2 . In fact, setting 
A C V(Y) the set of Dirac deltas, the function ijj is Lipschitz, with respect to the 
Wasserstein distance, over A. Hence it suffices to consider any Lipschitz extension of 
tp on the whole V(Y). For every n G N, since (5 tn )#fi = (o~ s )#fJ- we have 



t n (x)\ dfj,= ip(8 tn (x)) dfi = / ip(S s ( x )) dfi= / \s\ dfi. (1.9) 
x Jx Jx Jx 

Therefore, by passing to a subsequence, we may suppose that t n is weakly convergent 
and let t be the weak limit of t n . 

Let 7 = /(x)®/x By definition of weak convergence, by approximation with continuous 
functions, for any g G L 2 (X,M. N ) we get 

/ {g,t)dfi= lim / (g,t n )d/J,= lim / ( / (g,y)d5 tn{x) (y)) dfi = 
Jx n ^+°°Jx n ^+°°Jx \Jy J 



(g,y) df{x) J dfi = J (g,J y df(_x))dfjL = J {g,/3(f(x)))dp. 

Therefore t n — ^ /3(f(x)). On the other hand, since 7 G [77], i.e. f#[i = (5 tn )#fi, and by 
(II. 9p we have 
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\p(f(x))\ 2 d^= / \(3(5 s(x) )\ 2 dfi= / \s\ 2 dfi= / |t„|^. 

Then, it follows that t n strongly converges to /3(f(x)), hence by [3j Lemma 9.1] we 
deduce that 7 = 5 t ( x ) ® fJ- with t(x) = /3(f(x)). □ 

Of course, the above arguments hold true as well by considering L P (X, /i) with p > 1. 
This corresponds to consider the transport cost d p (x,y). 

Proposition 11.91 allows to reformulate the Monge transport problem as follows: 
min 



x 



| = min <j 


[Ix 







c(x, t(x)) dfi : = v > = min < / c(x, y) d'y : 7 G [S s <8> ji] > , (1.10) 



for a given transport map s. 

By density of transport maps, the Kantorovich transport problem can be seen as 

mm I / c(x,y)) d'y : 7 6 U(fi, v) \ = min \ \ c(x,y) dj : 7 G [5 t ® //] \ , 

(1.11) 

for a given transport map t. 

Therefore, the Monge problem corresponds to minimize the functional j XxY °{ x i y) d'y 
in a fixed transport class of n(/i, v), while the Kantorovich one corresponds to minimize 
the same functional on the whole n(/i, v). 

2. Monge-Kantorovich problems on transport classes 

In the previous section we have seen that the Monge problem could be seen as a 
particular case of minimization on a transport class. Since the transport classes are 
determined through the push-forward of disintegration maps, they can be assigned by 
probability measures A over (V(Y), W). 

Actually, consider / <g> fj, G n(/i, v) and A = /#//. Since (7r 2 )#(/ ® fj) — v, for every 
V? G C(F) we have 



ip{y) dv = / / ^(y) d/(z) U// = / I v (f(x)) d^i = / I v (\) dA(X) 

Y JX \Jy J Jx Jv(Y) 

tp(y) dX ) dA. 

IV(Y) " 

Therefore, in order to define a transport class, the measure A has to satisfy the con- 
straint 



XdA = u. (2.1) 

IV(Y) 

Hence, every probability measure A over (V(Y), W) satisfying (12 .ip defines a transport 
class [r]] = {f ® fjL : = A}. 

For instance, the transport class [// <8> ^] corresponds to the measure A = 8 V . While 
for a transport map t, the transport class [5t( x ) <8> //] corresponds to A = f x 5s t{x) dfi. 
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On the other hand, the transport class in ( 11. 8ft corresponds to the discrete measure 
^ = <3 <*« f^W ^ n ^ s P ers P ec tive, transport plans in the transport class A can be 
seen as transport maps between \x and A. It is then natural to consider the Monge- 
Kantorovich problem in the class A defined as follows 



MK A (c, /i, v) := inf I / c(x, y) d 1 | 7 = / <g> fi, = A } (2.2) 

1 UxxY 



By Proposition ll.9[ the Monge problem corresponds to the transport class A = 

Remark 2.1. Observe that for a discrete transport class A = ^ i oii8\ i1 the Monge- 
Kantorovich problem in the class A yields the optimal allocation problem consisting in 
the determination of a Borel subdivision {Ui} of X minimizing 



Y^J c(x,y)d\- S j d/i, 



under the constraint ^ oti\ = v. For existence results in this framework see Section 
4. 

The notion of transport class leads naturally to consider an abstract Monge problem 
between the space X and V(Y). Consider the following transport cost 

V(x, A) G X x V(Y) : c(x,\) = J c(x,y)d\. (2.3) 

We have the following 

Proposition 2.2. For every transport class A C n(//, v) we have 

M(c,/x,A) = MK A (c,(i,u). 

Proof. It suffices to observe that for any disintegration map / : X — > V(Y) such that 
f#/j, = A, it results 



c(xj(x)) dfi= / / c(x,y)df(x) ) dfi = / c(x,y) d(f ® p). 

X JX \JY J JXxY 



□ 



Observe that by the above proof it follows that / is a solution of M(c, A) if and 
only if / <g) /j, is a solution of MK A (c, fx, v). Therefore, every existence result for the 
Monge problem M(c, li, A) in the abstract setting corresponds to an existence result 
for the Monge-Kantorovich problem in the transport class A. The abstract setting has 
the advantage of considering a nice cost, since it is linear with respect to the second 
variable. Of course the disadvantage is the passage from the space X xY G M> N x 
to the space X x V(Y). Observe that by [TH Theorem 2.4] the transport classes are 
always non-empty provided \l is non-atomic. 
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Proposition 2.3. c is continuous, l.s.c, Caratheodory, normal iff c is. 

Where we say that a measurable map c(x, y) is Caratheodory (resp. normal) if 
c(x, •) is continuous (resp. l.s.c). 

Lemma 2.4. Let c : X x Y — > [0, +oo] be a Borel cost function satisfying 

\c(xi,y) - c(x 2 ,y)\ < a(x x - x 2 ), (2.4) 

for a given map a : X — > R continuous at x = and such that a(0) = 0. VKe /jai>e i/ie 
following 

(1) If c(x, •) zs continuous then c (and hence c) is continuous. 

(2) Ifc(x,-) is l.s.c. then c (and hence c) is l.s.c. 

Proof. Let c(x, •) be continuous. If (x n ,y n ) — > (x,?/) on X x F we compute 
|c(x, y) - c(x n , y„) | < \c(x, y) - c(x, y n ) \ + \c(x, y n ) - c(x n , y n ) \ < 

\c(x, y) - c(x, y n ) \ + a(x - x n ) 0, 
as n — > +oo. If c(x, •) is l.s.c. considering 

c(x n , y n ) = c(x n , y n ) - c(x, y n ) + c(x, y n ) , 
by (12. 4p . passing to the liminf we obtain 



lim inf c(x n , y n ) = lim inf c(x, y n ) > c(x, y) . 

n— >+oo n~ >+oo 



□ 



In general, for the existence of optimal transport plans in the Kantorovich problem 
at least the lower semicontinuity property of the cost function is usually required. 
Actually, some regularity of the cost function is needed to obtain a useful duality 
theory or to ensure that the infimum of the Kantorovich problem is equal to the 
infimum of the Monge problem (see for instance [5J HH]). However, it is not hard to 
verify that the Kantorovich problem admits solutions under more weak requirements 
on the cost function (see for instance [7]). 

For reader's convenience here we provide some details based on disintegration maps. 
Let 7r n = /„ 8 /j be a minimizing sequence for the Monge-Kantorovich problem (10.21) . 
Passing to a subsequence we may suppose that vr n ^/®/i = 7rG II(/i, v). Now, for 
any ^(x) G C(X),tp(y) G C(F), we get 

tp(x)ip(y) dn= ip(x) I / cp(y)df(x) ) d\i = 
xxy Jx \Jy J '2 5) 

lim / i/j(x) I / ip(y)df n (x) ) d/i = lim / tp(x)ip(y) dn n . 

n ^+°°Jx \Jy J n ^ +00 JxxY 

By density of continuous functions, the above limit holds for G L 1 (X, /i) as well. 
Therefore, if the cost function has the form c(x,y) = a(x)b(y), with a G L 1 ,b G C, 
then by (12. 5p it follows that 7r is an optimal transport plan. Arguing component-wise, 
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the same reasonings apply to linear costs c(x,y) = (a(x),y). For a Caratheodory cost 
function c(x,y), i.e. a Borel map such that c(x, -) is continuous, observe that the 
disintegration maps f n : X -> V(Y) C M(Y,R) belongs to L°°(X, M(Y, R)), which 
is the dual of C(l")). Therefore, by passing to a subsequence we may suppose 

that f n /, i.e. 

lim / f / ^,y)^(x)W = / ( / V(z,y)d/(ar)W W> G L 1 ^, C(F)). 

n^ + C0j X \Jy J J X \Jy J 

The above continuity property shows that n = f <g> fi is an optimal transport plan, 
provided that J y sup y c(x, d/i < +00. 

If c(x, y) is a normal cost, i.e. a Borel measurable map such that c(x, •) is lower semi- 
continuous, then it can be reduced to a Caratheodory cost by standard approximation 
procedures. For instance (see [10J ) , we may write 

c{x,y) = sup a h (x) b h (y), b h eC(Y). 

h 

Hence, the cost Cj = sup^- afci is Caratheodory and Cj /* c. Since 

c j (x,y)dn n < / c{x,y)dn n , 

xY JXxY 

passing to the limit we obtain 

Cj(x, y)d-K < liminf / c(x,y)dTi n . 

XxY J XxY 

Passing to the limit with respect to j we get that 7r is optimal. By representation 
of weakly* l.s.c. functionals (see jl]), the same reasonings apply to normal cost on 
X x V(Y). For a related result see also [13]. 

In the following we compare the Kantorovich problem (I0.2p with the abstract version 
formulated using the transport classes. 

Lemma 2.5. For every transport class A C II(/x, v) it results 

MK{c,n,u) < MK(c,n,A). 
Proof. Let tx = Af(x) <S> fi E Il(/i, A). We compute 



/ c dn = [ [ c(x,y)d\\ dAf(x) \ d/j, = 

JXxV(Y) JX \Jv(Y) \Jy J J 

c(x,y)df(x)) dfi= c(x,y) d(f <g> fi), 

X \JY J JXxY 



(2.6) 



where f{x) = § V , Y \ ^ dAf(x). It remains to check that (7r 2 )#(/(8) (j) = v. By (12. ip . for 
every 99 G C(Y) we have 
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<p(y)df(x)) dfi = [ ( [ ( [ <p(y)dx) dAf(x)) dfi= f I v (\) dn = 

J JX \JV(Y) \Jy J J JXxV(Y) 

= [ I v (\) dA = f ( [ <p(y) dx) dA = f ip(y) dv. 

JV(Y) JV(Y) \JY J JY 



□ 



Observe that the above definition of f[x) = f3(j\f(x)) can be seen as a generalized 
barycenter map. Indeed, we have the following 

Lemma 2.6. The generalized barycenter map f3 : V(V(Y)) — > V(Y) defined by 



PIN) = [ 

Jv 



X djV 

V(Y) 

is 1-Lipschitz with respect to the Wasser stein distance. 

Proof. First observe that if ip G Lip^F) then I 9 G Lip 1 (P(y)). Fixed ip G Lip^F) 
we get 

/ tp - PN2)) = [ ( [ <pd\) d(M - A/2) = 

JY JV(Y) \JY J 

7„(A) d(Ni - J\f 2 ) < W(Afi,Af 2 ). 

IV(Y) 

Taking the supremum with respect to (p G Lip 1 (F) it results 

W((3(Afi), f3(Af 2 )) < W(jVi,jV 2 ). 

□ 

Observe that a probability measure A on V(Y) defines a transport class iff its 
generalized barycenter is equal to v. 

Lemma 2.7. If c is normal then there exists a transport class A C H(fJ,, v) such that 

MK(c, (i, v) = MK(c, n, A). 

Proof. Let ir n = f n <S> fi G H(fj,,u) be a minimizing sequence for MK(c, fJ>,u). Set 
A n = (/ n )#A*- By passing to a subsequence we have that A n — 1 A with respect to the 
weak convergence of measures. Observe that 



/ ( / up dX j dA = [ dA = lim / dA n = lim / J ¥ 

JV(Y) \Jy J JV{Y) n ^+°°Jv{Y) n ^+°°Jx 

<p(y)df n (x) S j dfi = J (f dv. 



(fn(x)) dfJ, 



= lim 
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Therefore, A defines a transport class. Consider the transport plans n n = (I x / n )#/z G 
H(/j,,A n ). Passing to a subsequence we may also suppose that weakly ff„ — n G 
II(/i, A). Since c is normal we get 

MK(c, \i, A) < / c dn < liminf / c G?7r n = 

JXxV(Y) n^+oo J XxT (Y) 



= liminf / c(x, f n (x))dfj, = liminf / I / c(x,y)df n (x) ) d/j = MK(c, 

n^+oo J x n^+oo J x \J y J 

The result follows by Lemma [2.51 □ 



By the above analysis the Kantorovich problem over X x V(Y) is essentially equiva- 
lent to the usual Kantorovich's one. Indeed, if c is normal (for instance if the cost c sat- 
isfies the conditions of Lemma |2^|) by Lemma I2TT1 we have MK(c, /i, v) = MK(c, fi, A) 
for a transport class A. If n = M{x) <8> ^ is a transport plan for MK(c, /i, A), for such 
transport class A, setting f(x) = f v ^ ^ dj\f(x), by (12.61) it follows that /(x) <8> /x is 
an optimal plan for MK(c,fi, v). For a related relaxation procedure see [12] . 

3. Existence and uniqueness for the Monge problem 

In the previous section we have seen that the Monge-Kantorovich problem is es- 
sentially equivalent to the transport problem on transport classes. Of course, the 
Monge problem reveals hard to handle. For linear cost c(x, y) = (x, y), which is equiv- 
alent to the quadratic cost c(x,y) — \x — y\ 2 because of the expansion \x — y\ 2 = 
\ x \ 2 + \v\ 2 ~~ 2(x,y), it is relatively easy to find existence and uniqueness of optimal 
transport maps. We expect some advantage by considering the special form of the cost 
c for the Monge problem on a transport class. In order to handle with the abstract 
Monge problem, we review here some usual tools for solving the Monge problem. The 
available approaches to existence and uniqueness rely more or less on two basic facts. 
The first one is based on the notion of c-cyclical monotonicity. A set S C X x Y 
is said c-cyclical monotone if for any finite set of pairs (xi,yi), . . . , {xk,Vk) an d any 
permutation a the following inequality holds true 

k k 
^2c(xi,yi) < ^2c(xi,y a( i)). 

i=l i=l 

A fundamental fact in mass transportation is that the support of every optimal trans- 
port plan is a c-cyclical monotone set and every c-cyclical monotone set is contained 
in the c-superdifferential d c ip (or contact set) of a c-concave function if}, where 

d c ip = {y E Y : i/j(x') - ifj{x) < c(x', y) - c(x, y) Vz' G X}. (3.1) 

A function if> is said to be c-concave if there exist A x B G Y x W such that 

if}(x) = inf c(x, y)+t. 

(y,t)&AxB 
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For details we refer the reader for instance to [T9l 122]. The c-transform of ip is defined 
by ifj c (y) = mf xeX {c(x, y) — tj}(x)}. It can be shown that y G d c ip <^ ip(x) + if) c (y) = 
c(x,y). If one is able to show that for G X the c-superdifferential is single 

valued, then every transport plan is supported on the graph of a transport map (see 
[3j [19j [22] ). Namely, there exists a unique solution of the Monge problem. The same 
reasonings apply as well directly to the abstract problem M(c,//,A). In other words, 
if the c-superdifferentials is single- valued, then the Monge problem M(c, //, A) admits 
a unique solution. In this framework the two Monge problems are essentially related. 
Indeed, suppose that the c-superdifferentials contain just one Dirac delta and let ip be 
a c-concave function. We set 

A = {5 y : y G A} x B C V(Y) x R 

and ip(x) = ini^ xt - )e ^ xB c(x, X) + t. It follows that ip is c-concave and ip(x) = i/j(x). 
Recalling that c(x, 5 y ) = c(x,y), the following implications hold true 

y G d c vp(x) vp(x') — ip(x) < c(x' , y) — c(x, y) <^ 

^(X') - ljj(x) < C(X', 5y) - C(X, 5y) Sy G ^^(x). 

Therefore d c ip{x) = {y}. 

Vice-versa, suppose that c-superdifferentials are single valued and let ip be a c- 
concave function. Consider the c-transform {ip) c {y) = inf x {c(x,y) — i/j(x)}. 
We have 

S y G d Cr ip(x) <^ ip(x') — ip(x) < c{x' , y) — c(x, y) <^ c(x, y) — ip(x) < c(x' , y) — i/)(x') 

^ c(x, y) — ijj(x) = {ip) c {y) ^i/G d c i/j(x) C d c u(x) 
for a c-concave function w (see for instance [ISl Remark 3.12, Theorem 3.10]). It follows 
that ifi c (y) contains just one delta. Of course this singleton condition of superdiffer- 
entials can be achieved under additional requirements on the cost function. For a 
differentiable cost a general condition relies in the so called twist ( or Spence-Mirrlees 
in economic settings) condition, i.e. 

x i — y c(x, yi) — c(x, y 2 ) has no critical point V yi ^ y 2 (3.2) 

(see (TTJ |15l [8]). For a generalization of such condition, in the case of suitable ge- 
ometries see [2]. In the case X = Y = M with M a Riemannian manifold and for a 
Lagrangian cost it is enough for the cost c to satisfy the Mather's shortening principle 
and the connectedness of the c-superdifferential, as it is shown in [22, Chapter 9]. 
Observe that in this case the connectedness is a key property. This property is easily 
satisfied if c(x, ■) is linear as happens just for the cost c(x, y) = (x, y). However, general 
forms of the cost c which guarantee the connectedness of the c-superdifferential are 
not known. In this perspective, the consideration of the linear cost c could be useful. 
Observe that for a cost c linear with respect to the second variable, the twist condition 
is not in general satisfied. Consider for instance c(x, y) = (a(x),y) for possibly not 
invertible Jacobian matrix Va(x). 
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3.1. Monge-Mather's shortening principle. By considering the cost c, it may 
happen that the c-superdifferential contains many points, which actually are proba- 
bility measures of V(Y), although the c-superdifferential is a singleton. However, we 
have the advantage that the c-superdifferential are convex sets. Consider X = Y = M 
and a cost c satisfying a shortening principle. We briefly sketch the reasonings of [22j 
Chapter 9]. Suppose that the following conditions are satisfied 

(1) There exists D C M with /jl(D) = such that D intersects every nontrivial 
continuous curve over M. 

(2) The cost c satisfies a shortening principle. 

(3) The super differential d c ip is connected. 

By assumption (2) it is possible to define a function F : ^ x ' y {\) h-> x with y G d c tp(x) 
having as domain the mid point of geodesies over M. Indeed, by definition of c- 
superdifferential we get 

V>Oi) - 4>(x) < c{x u y) - c(x, y) ip(x) - i>{x x ) < c(x, y x ) - c(x u y x ) 

for every y G d°ijj(x),yi G d c ifj(xi). It follows 

c(xi, yi) - c(x, y{) < c(xi,y) - c(x, y) => c(x l5 y x ) + c(x, y) < c(x u y) + c(x, y x ). 

Hence, the shortening principle implies that 



By the inequality (I3.3P it follows that F is well defined and moreover it results a 
Lipschitz, actually also an Holder condition works as well, map. By condition (1) and 
(3) it is possible to show that the set of points on which d c ifj is not single valued is of 
null measure. Indeed, let 1/1,2/2 G d c vp(x). By condition (3) consider a continuous curve 
p t lying in d c ijj(x) connecting yx^yi- Therefore it is defined the non-trivial continuous 
curve m t = 7 X ' P '(|). Hence, x = F(m t ). By point (1) it follows that x G F(D) 
which is a null measure set. Since c(x,y) = c(x,5 y ), we would like to prove that the 
c-superdifferential are a.e. single valued. The analogue condition for c superdifferential 
is 




(3.3) 



c(xi, Ai) + c(x, A) < c(x%, A) + c(x, Ai). 



By definition, the above inequality leads to 



MK(c,S Xl ,X 1 ) + MK(c,5 t 



A) < MK(c, 6, 



X) + MK(c,5 x ,X 1 ). 



A shortening principle should allow to estimate the distance d(x, x±) by the distance 
between mid points of geodesies in M or in V(M). The main problem is to find a null 
set D, due to the fact that we are now dealing with the space M x V(M). 



18 



LUCA GRANIERI, FRANCESCO MADDALENA 



3.2. Twist condition. Assume the cost function c satisfies the twist condition (13. 2(1 . 
Under differentiability requirement, for instance on an open and bounded set Q C ~K N , 
with a uniform Lipschitz condition 

\c(x, y) — c(x', y)\ < K\x — x'\, Wy G Y, (3.4) 

this means that the map y V x c(x, y) is injective. It turns out that c-concave 
functions ip are Lipschitz on Q. Indeed, it suffices to compute 

if)(x 2 ) = inf {c(x 2 , y) + t, (y, t) G A x B} = 

= inf{c(x 2 , y) - c(xi,y) + c(xi,y) + t, (y, t) G A x B} < K\xi - x 2 \ + ip(xt). 
Moreover, if ip is differentiable at x G Q, for every y G d c ip(x), we have 

ip(x + tv) — ip(x) c(x + tv, y) — c(x, y) 
t ~ t ' 

Passing to the limit as t — > + we get 

(ViP-V x c(x,y),v) < 0. 

By the arbitrariness of v it follows Vip(x) = V x c(x,y) (see also |8J). Therefore, 
if [I is absolutely continuous with respect to the Lebesgue measure, then the twist 
condition implies that the superdifferential d c ip(x) is a singleton for /i-a.e. x G Q. 
The same reasoning applies as well directly for the cost c. Indeed, since V x c(x, A) = 
f Y ^xc{x, y) dX we have that c-superdifferentials contains at most one delta iff the cost 
c satisfies the twist condition. 

In the sequel we sketch a related approach to show existence and uniqueness for the 
Monge problem. 

Assume that c(x,y) is a Caratheodory function, hence by Lemma [2.41 c is continuous. 
Approximate the target measure v by a finite convex combination of Dirac deltas, say 
by v n G V(Y). Consider the problem MK(c, fi,u n ). The optimal transport plans of 
this approximation problem are of course extremal points of n(/x, v n ). It can be shown 
that these extremal points, which are supported on the graph of a c-concave function 
if) n , are of the form (Id x (see section 4), i.e. the Monge problem M(c, fx, v n ) 

admits a unique solution. 

By the uniform Lipschitz condition (13.4(1 it follows that the sequence of c-concave 
functions if} n is equi-Lipschitz. Indeed, fixed x, x' G Q, for y n G d c ip n (x) we have 

ip n (x') - ip n (x) < c(x',y n ) - c(x,y n ) < K\x - x'\, 

while for y' n G d c ip n (x') we have 

ip n (x) - ip n (x') < c(x,y' n ) - c(x',y' n ) < K\x - x'\. 

By the Ascoli-Arzela Theorem we may suppose that ip n — > ip, uniformly on compact 
subsets. Observe that ip is c-concave as well. Indeed, recall that a map i/j is c-concave 
iff ip — il) cc , where 

v/j cc (x) = inf{ , 0(x) < f(x) : /c-concave}. 
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Fixed £ > 0, we find a large integer n such that \4} n {x) — ip(x)\ < e. Since i[) n is 
c- concave we get 

i/;(x) <e + ifj n (x) 4> cc (x) < i/; n (x) + e < ijj(x) + 2s. 

By the arbitrariness of e we obtain if) cc < ip. 

Let y n = t n (x) G d c ip n (x). By passing to a subsequence we may suppose that 
y n -> y. Since 

^n(x') - ^ n (x) < c(x',y n ) - c(x,y n ) < K\x - x'\, 

passing to the limit as n — > +oo we get 

il>(x') — ijj(x) < c{x' , — c(x, y). 

Therefore y e d c ip(x). Since the c-superdifferential is a singleton, the whole sequence 
y n converges to y. We set t(x) = y. As limit of measurable maps, t is measurable as 
well. Moreover, it results 

/ f(t(x))d^i = lim / f(t n (x))dfi = lim / f(y)dv n = / f(y)dv. 

Hence t#n = v. Since the graph (x,t(x)) is supported on d c ijj(x) it follows, see [El 
Th. 3.22], that t is an optimal transport map. For a related approximation procedure 
of the Kantorovich problem see [13] . 

Hence, under the twist condition for the cost c, we would have existence and uniqueness 
of optimal transport plans in every fixed transport class. Therefore, an interesting 
question is that of finding condition on the cost c ensuring the twist condition for c. 

4. Existence in discrete transport classes 

To treat discrete measures we sketch existence for the Monge problem in this setting. 
For an extensive discussion of this case we refer to [TJ HI [18] . 

Let v = a i&yi be a discrete probability measure over a metric space Y. Let 7 be 
an optimal transport plan between \i and v. Denote by T = supp(7). By optimality, 
we have that T is a c-cyclically monotone set of X x Y. Define = {x G X : 
(x^yi), (x,yj) G T}. Let x,x' G A^j. By c-cyclically monotonicity we get 

c(x,yi) + c(x',yj) < c(x,yj) + c(x' < c(x,^) + c(a/,%). 

Therefore, we obtain 

c(x, yi) - c(x, yj) = c(x', y t ) - c(x' , yj) = X id . 

Under the assumption of fi c-continuous, i.e. 

fi ({x G X : c(x, y^ - c(x, yj) = X id }) = 0, 

for every yi,yj G Y, Ajj G K., we have that ^(Aj) = 0. Therefore, since the set of 
splitting masses is given by A = [J^Aij, if /i is c-continuous then ^(A) = 0. Hence, 
the transport plan 7 is induced by a transport map. 
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4.1. Existence in some transport classes. Let A be an atomic transport class. 
Optimal transport plan for MK(c, fi, A) are of the form J^i ^/* ® I 2 ( see P21 fl3]). 
For every index it results fi(x) £ (9 c, 0(x). By linearity of c it follows /(a;) = a ifi e 
d c if)(x). Therefore, the transport map f(x) is optimal for M (c, /i, 

4.2. Non-existence in some transport classes. Consider a discrete transport class 
given by A = Ylii^K- Consider the cost c(x,y) = (x,y). For i,j observe that 

c(x, Xi) - c(x, Xj) = (x, J y d(Xi - Xj)). 

Therefore, M(c, /i, A) admits solution iff /3(Aj) ^ fi(Xj). Analogously, for cost a(x)b(y), 
if /i({x £ X : a(x) = k}) = for every k £ R, it turns out that M(c, //, A) admits 
solution iff J Y b(y)d(Xi — Xj) ^ for every 

In this section we have seen that the usual approaches to solve the Monge problem 
give rise to some difficulties in the setting of the transport problem in a transport 
class. In some sense, these methods are specific for the transport class corresponding 
to transport maps. The question to establish existence in different transport classes 
remains open. We have shown that also for the case of a discrete transport class the 
answer could be negative. Therefore, from this point of view, the Monge problem 
corresponds to a lucky case for the fixed transport class. This feature of transport 
classes also naturally leads to the following question. The existence results for the 
Monge problem are usually stated in the following form: under some assumption on 
the spaces, on the first marginal /i and on the cost c(x,y), for every second marginal 
v the Monge problem admits solutions. Having in mind the abstract Monge problem 
M(c, /x,A), it could be also interesting to consider, under some assumption on the 
spaces, on the first marginal /i and on the cost c(x,y), the question for what kind of 
second marginals the corresponding Monge problem admits solutions. 
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